Войти
  • 63041Просмотров
  • 4 месяца назадОпубликованоJia-Bin Huang

The Weirdly Small AI That Cracks Reasoning Puzzles [HRM]

How can we build AI that can solve reasoning puzzles? A recent paper, "Hierarchical Reasoning Model," shocked the AI community with promising results on Sudoku, maze puzzles, and ARC-AGI benchmarks. This video provides an overview of the Hierarchical Reasoning Model. 00:00 Reasoning tasks 00:22 Hierarchical Reasoning Models' results 01:07 Problem setup 02:00 Transformer 02:37 Chian-of-thought reasoning 03:14 Recurrent models 04:31 HRM - Architecture 06:12 HRM - Gradient approximation 07:48 Specialized vs general models References: - Hierarchical Reasoning Model: - End-to-end Algorithm Synthesis with Recurrent Networks: Logical Extrapolation Without Overthinking - Scaling up test-time compute with latent reasoning: A recurrent depth approach: - Looped Transformers are Better at Learning Learning Algorithms, - Looped Transformers as Programmable Computers, Video made with Manim: